Summarizing Audiovisual Contents of a Video Program

نویسنده

  • Yihong Gong
چکیده

In this paper, we focus on video programs that are intended to disseminate information and knowledge such as news, documentaries, seminars, etc, and present an audiovisual summarization system that summarizes the audio and visual contents of the given video separately, and then integrating the two summaries with a partial alignment. The audio summary is created by selecting spoken sentences that best present the main content of the audio speech while the visual summary is created by eliminating duplicates/redundancies and preserving visually rich contents in the image stream. The alignment operation aims to synchronize each spoken sentence in the audio summary with its corresponding speaker’s face and to preserve the rich content in the visual summary. A Bipartite Graph-based audiovisual alignment algorithm is developed to efficiently find the best alignment solution that satisfies these alignment requirements. With the proposed system, we strive to produce a video summary that: (1) provides a natural visual and audio content overview, and (2) maximizes the coverage for both audio and visual contents of the original video without having to sacrifice either of them.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Summarization for Generic Audiovisual Content

Nowadays, with the explosion of multimedia content availability, the selectiveness of its consumption increases in importance. Audiovisual content is no longer brought to us only by the television, being available through many other systems, like Personal Video Recorders and Video on Demand systems, on each one’s Personal Computer or, obviously, in the Internet. The exponential growth of websit...

متن کامل

Experienced Quality Factors - Qualitative Evaluation Approach to Audiovisual Quality

Subjective evaluation is used to identify impairment factors of multimedia quality. The final quality is often formulated via quantitative experiments, but this approach has its constraints, as subject’s quality interpretations, experiences and quality evaluation criteria are disregarded. To identify these quality evaluation factors, this study examined qualitatively the criteria participants u...

متن کامل

Collages as Dynamic Summaries of Mined Video Content for Intelligent Multimedia Knowledge Management

The video collage is a novel effective interface for dynamically summarizing and presenting mined multimedia information from video collections. We will discuss how collages are automatically produced, illustrates their use, and evaluates their effectiveness as summaries across news stories. Collages are presentations of text and images extracted from multiple video sources. They provide an int...

متن کامل

Audiovisual production invariant searching

Information searching in non-textual media is a fundamental point of interest, especially in the audiovisual industry where there is still an important need of tools for manipulating multimedia contents. In video documents, the style signature extraction is a highly interesting process since it provides a new feature for contents classification. Video documents may have very different character...

متن کامل

Semantic transcoding of video based on regions of interest

Traditional transcoding on multimedia has been performed from the perspectives of user terminal capabilities such as display sizes and decoding processing power, and network resources such as available network bandwidth and quality of services (QoS) etc. The adaptation (or transcoding) of multimedia contents to given such constraints has been made by frame dropping and resizing of audiovisual, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2003  شماره 

صفحات  -

تاریخ انتشار 2003